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RNA^BEMSBENT RNA POL«ffiRASE FUNCTIO^it^REFERABLV ON 

POLYMEk?^E 

5 The subject of the invention is a method of 

transcription, which makes it possible to synthesize 
RNA strands complementary to an RNA template, as well 
as new RHA polymerases which make it possible to carry 

10 out this method. 

The method of the invention leads to the 
amplification of RNA present in small quantities in a 
biological sample, .nd thus allows the detects and or 
^tification of the J^t ^U^til - 
" Tar™ J Jlield of microbiology and virology. 
Z more generally in the field of medical diagnosis^ 
The method of the invention may also be used in the 
synthesis of RNA probes. 
, n It is known that in microbiology and 

virology, the microorganisms which it is sought to 
identify are often viable bacteria 
containing more RNA than DNA, or RKA viruses such as 

Tt- is also known that m 
the HIV and HCV viruses. It is 
25 various pathologies, it is often advantageous to 
monitor the variations in the expression of genes, and 
therefore in the synthesis of messenger RNA. 

It is therefore important to be able to have a 
simple and effective method of amplifying an RNA 

30 " r9St ' The PCR method, which makes it possible to 
cyclically amplify a ONA target, uses a single enzyme 
but retires the production of temperature cycles^ 
generally at three different temperatures. The PCR 
method may be adapted to the amplification of an RNA 
target by adding an additional enzymatic activity of 
^-dependent DNA polymerase, which further complicates 
this method. 
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The so-called NASBA/TMA method of amplification 
has the advantage of being an isothermic method, but 
requires the use of three enzymatic activities (In- 
dependent DNA polymerase, RNase H and DNA-dependent RNA 
5 polymerase) carried by two or three enzymes. 

It is therefore desirable to be able to have a 
simple and automatable method of amplification of RNA, 
and in particular an isothermic method using only one 
enzyme . 

10 to avoid the disadvantages, which have D ust 

been mentioned, of known amplification techniques, it 
therefore appears to be necessary to use, for the 
amplification of RNA, an RNA-dependent RNA polymerase 

activity. 

Unfortunately, the known natural RNA-dependent 
RNA polymerases (RNAd RNAp) are not suitable for such a 
use because they have specific requirements as regards 
the RNA template, and their activity requires the 
presence of protein cofactors (also called auxiliary 
protein factors or associated protein factors) . 

It has now been discovered that some known DNA- 
dependent RNA polymerases are capable of transcribing a 
single-stranded RNA in the presence of a double- 
stranded DNA promoter. Furthermore, some of these 
25 enzymes, which are transformed by mutation, are capable 
of synthesizing a transcriptional product with a better 
yield when the template consists of RNA than when the 

template consists of DNA. 

In the present application, the term 

30 "transcription" designates- the synthesis of several 
strands of RNA in the presence of a polynucleotide 
template and of ribonucleoside triphosphates, in an 
appropriate reaction medium and under conditions 
allowing the catalytic activity of an RNA polymerase to 

35 be exerted. The transcription occurs by synthesis of a 
complementary or antiparallel copy of the template. The 
strand of the template which is copied is called the 
transcribed strand or the template strand. The 
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synthesis of the RNA progresses in the 5 ' 3' 
direction. 

It is known that some RNA polymerases function 
under the control of a promoter. A promoter is a 
5 double-stranded nucleotide sequence recognized for the 
RNA polymerase and necessary for ^> the initxatxon 

of transcription. 

It should be recalled that when the template 
strand is linked to the promoter, the first nucleotide 
10 transcribed on the template strand, linked by its 3 
end to the 5' end of one of the strands of the 
promoter, is designated by + 1. The strand of the 
promoter which is linked to the template strand is 
called the antisense strand. The other strand of the 
15 promoter, which is complementary to the antisense 
strand, and hybridized to it, is called sense strand^ 
The successive nucleotides which are situated on the 
side of the promoter, with respect to nucleotide + 1. 
are , starting from + 1, numbered -1, -2, -3, and the 

20 like. , c, 

The position -1 therefore corresponds to the 

-nd of the antisense strand of the promoter, and to the 
3. end of the sense strand. However, some authors 
include the nucleotide sequence corresponding to the 
25 region where the transcription starts (in particular 
the sequence from + 1 to + 6. for which a consensus 
sequence can generally be defined) in the definition of 
the sequence of the promoter. 

On the template strand, the positions of the 
30 successive nucleotides copied, starting from + 1 and 
therefore in the 3 • - 5 • direction, are noted + 2 . + 3 . 

and the like. 

in the text which follows, the terms sense 
strand and antisense strand are generally used for the 
35 promoter itself (positions numbered negatively), and 
the term non-template strand is used for any strand 
linked to the 3 ■ end of the sense strand, and the term 
template strand for any strand linked to the 5' end of 
REPLACEMENT SHEET (RULE 26) 
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-™h nr for any strand hybridized to 
the antisense strand or ror any , ^ 0 

^ errand In a given polynucleotide 
the non- template strand. my 

strand -upstream region- refers to a reason situated 
on side of the 5- end, and -downstream 

region situated on the side of the 3- end. However « 
tne domain of transcription under the control of a 
Remoter, and without taking into considerate a 
promoter . ups tream- region traditionally 

particular strana, ^ .1 i o 

refers to the region which, relative to position +1. « 
on the side of the promoter (positions indicated by 
TegatTve nt^ers, . and -downstream- region the region 
Ttuatea on the side of the template copied .positions 
indicated by positive numbers, _ ^ 

downstream direction then corresponds to the 3 
direction on the template strand, and to the 5 
direction on the newly- synthesized RN* •««*■ 

The template strand is not necessarily linked 
to the 5' end of the antisense strand of the promoter, 
to the 5 « case , be hybr idized to a 

However, it should. <non-template 
20 complementary and antiparaliel _ 
str and, which is itself linked by its 5 end ^ to th* * 
end of the sense strand of the £°^™_~™ 9W 

nnFTSCH PW., Biochemistry 33, 1492b ±4^* 
and DOETSCH P.W 577 -585 (1995). In such a 

and ZHOU W. et al • , Cell 82. 3 " 

ana ^nuu in any position, 

OS case the transcription may start m any * 
25 case, tn corresponding to the 3' 

which may range from +1 to +24, corr p 

h of the template strand or of the part of 

end of the temp non- template 

template strand hybridized with 



15 



Strand ' „ •«->, bacterial, eukaryotic 

Compared with bacterial, 



or 



30 compare 

mitochondrial RHA polymerases . P > 

polymerases are very simp e enzymes. The be 

4-v,«n, *t-<= the RNA polymerases of tne i , 
among them are the P ^ polyme rase 

SP6 bacteriophages. The Dace y 
35 has been cloned; see in particular patent US 4 952 496 
These enzymes are highly homologous to one another and 
J o7 a single subunit. The natural promoters 
consist of a smgi sp6 
specific for the RNA polymerases of the 
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phages are well known. The sequencing of the whole 
genome of the bacteriophage T7 (Dunn et al . , J. Mol. 
Biol. 166, 477-535 (1983)) has made it possible to 
define the existence of 17 promoters on the DNA of this 
5 phage. Comparison of these 17 sequences shows that 23 
contiguous nucleotides situated between positions -17 
and + 6 relative to the site of initiation (position +1) 
of transcription, are highly conserved. These 
nucleotides are even identical in five so-called class 
10 III promoters, which are the most efficient in 
particular in vitro. Likewise, many promoter sequences 
specific for the T3 RNA polymerase also exhibit a very 
high homology, in particular between positions -17 and 
+6. Moreover, several different sequences of promoter 
15 for phage SP6 RNA polymerase have been identified and 
also exhibit a high homology; see Brown J.E., et al., 
Nucleic Acids Res. 14, 3521-3526 (1986). 

It is therefore possible to consider that the 
various ohage RNA polymerases mentioned above form part 
2 0 of a family of RNA polymerases which recognize 
promoters having a consensus sequence from position -17 
to position +6, and in particular from position -17 to 
position -1 • 

The method of the invention makes it possible 
25 to transcribe any RNA sequence because the RNA 
Polymerases, capable of transcribing RNA under the 
control of a promoter, which are described in the 
present application, may transcribe RNA without a high 
sequence specificity. It is known, nevertheless, that 
30 some sequences for initiation of transcription, in 
particular from position +1 to position + 6, are more 
favorable than others for obtaining transcripts of the 
expected length with a given phage RNA polymerase, in 
the case of the transcription of DNA; see for example 
35 Milligan J.F. et al., Nucleic Acids Research, 15, 8783- 
8798 (1987). The RNA polymerases capable of 
transcribing RNA which are described in the present 
application may also function with variable yields 
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according to the sequence of the region for initiation 
of transcription. The sequences which are most suitable 
for a given RNA polymerase may be determined, where 
appropriate, by simple routine experiments similar : to 
5 those described by Milligan at .1 . in the article which 
has just been mentioned. In addition, as will be seen 
below, the method of transcription of the invents 
mak es it possible, where appropriate, eithe r to start 
the transcription in a favorable region of the RNA to 
10 be transcribed, or to provide a reagent-promoter which 
already contains a region for initiation of 
■ transcription having a sequence favorable for a given 

RNA polymerase. _ 

The subject of the present invention is 
therefore a method of amplifying any RNA target 
sequence, by transcription under the control of a 
promoter, in an RNA sample comprising said target 
sequence , 

in which said sample is brought into contact: 

- with a reagent capable of hybridizing with 
said RNA comprising said target sequence, 

- and with an enzymatic system comprising an 
RNA-dependent RNA polymerase activity, 

under conditions allowing the hybridization of said 
25 reagent with said RNA comprising said target sequence 
and under conditions allowing the functioning of said 
RNA-dependent RNA polymerase activity; 
in which said reagent contains: 

(i ) a first nucleotide strand comprising: a) a 
30 first nucleotide segment capable of playing the role of 
sense strand of a promoter for said RNA polymerase 
activity and b) , downstream of said first segment, ^ a 
second nucleotide segment comprising a sequence capable 
of hybridizing with a region of said RNA, and 

(ii) in the hybridized state on the first 
strand, a second nucleotide strand comprising a third 
nucleotide segment capable of hybridizing with said 



20 
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-p^v-m wi t-h it a functional 
first segment so as to form witn it 

double- stranded promoter; 

and in which said RNA polymerase activity is capable of 
transcribing an RNA template, in the presence of said 
reagent hybridized with said template, in the absence 
of associated protein factor and in the absence of a 

ligase activity. 

The general conditions allowing the 
hybridization of nucleotide strands are known, and 
specific conditions may be easily determined, by 
routine experiments, for strands of a given sequence. 
The conditions allowing the functioning of the RNA 
polymerase activity, in the presence of ribonucleoside 
triphosphates, may also be easily determined by 
experiments, optionally with the aid of the information 
provided in the experimental section below. 

Th e 3' end of the first segment corresponds to 
Position -1 in the transcriptional system used. The 
f<rst segment contains a sufficient number of 
nucleotides to be able, in the hybridized state, to 
play the role of a promoter for an RNA polymerase. 
According to a specific embodiment, the first segment 
contains at least 9 nucleotides. 

in patent FR 2,714,062, it has been shown that 
short sequences of 6 to 9 consecutive nucleotides 
chosen from the -12 to -4 region of the sense strand of 
a promoter for a phage RNA polymerase are capable of 
playing the role of functional promoters xn the 
transcription of a DNA target sequence. 

The reagent used in the method of the invention 

=( _ lpaqt . on e of the following 
may also exhibit at least one 

characteristics: 

- said third segment is flanked, at its 
upstream end, by a fourth nucleotide segment which is 
shorter than said second segment of the first strand; 

- said fourth segment is capable of hybridizing 
with a portion opposite said second segment. 

- said first and third segments consist of DNA; 
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- said third and fourth segments consist of DNA 

or RNA. a _ 
The third segment may have the same length as 

ti- mav also be shorter or longer, 
the first segment. It may aibu ^<= 

but its 5' end must correspond to position -1 (that is 
to say the position immediately preceding the positxon 
for initiation of transcription in the case where the 
template strand is linked to the promoter) , when xt xs 
hybridized with the first segment. 

When the second strand of the reagent does not 
contain the fourth segment, the reagent may be used xn 
particular to transcribe an RNA whose 3' end region, or 
a region close to the 3- end, has a known sequence and 
<n this case the second nucleotide segment of the fxrst 
"strand is constructed so that said RNA, in the vicinxty 
of its 3- end, is capable of hybridizatxon with at 
Wst part of the sequence of said second nucleotxde 
s ;gment P The 3- end of the part of the RNA to be 
transcribed which is hybridized to the second segment 
20 may be contiguous to the 5' end of the third segment 
or .t may be distant therefrom by a number x of 
nucleotides (counted on the second segment), x 
representing zero or an integer from 1 to 24 . Of 
course, the length of the second segment (in number of 
nucleotides) is greater than x, in order to be able to 
ensure the binding of the RNA template to be 
transcribed, by hybridization with a downstream regxon 

of the second segment. 

The fourth segment, containing for example from 
1 to 18 nucleotides, and in particular from 1 to 12 
nucleotides, preferably has a sequence chosen so as to 
favor the initiation of transcription for a gxven RNA 
polymerase (see in particular the experimental sectxon 
below). The fourth segment may be produced xn 
particular in DNA. Its sequence may be complementary to 
the upstream region of the second segment facing xt and 
to which it is then hybridized. In this case, the 

£ c i r pa-i nn of the second 

choice of the sequence of the 5 regxon 
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segment is dictated by the choice of the sequence of 
^fourth segment. It is not necessary for the fourth 
segment to be linked to the third segment since in any 
case its correct positioning in order to favor the 
initiation of transcription may be ensured by its 
hybridization to the second segment. However, in a 
specific embodiment, the fourth segment is linked to 

the third segment. 

As above, the 3- end of the target n» part 
which is hybridized with the second segment may be 
distant from the 5- end of the fourth segment by a 
number of nucleotides equal to x, as defined above. 

For obvious reasons, the second segment 
contains a number of nucleotides at least equal to the 
sum of the number of nucleotides of the fourth segment 
if it is present, and of the number of nucleotides of 
sa.d sequence of the second segment which is capable of 
nyoridizing with said region of the to be 

trSnSCri "he' method of transcription of the invention 
m ay be carried out with a virus or phage wild-type R»A 
polymerase, and in particular with an 

chosen from the family of polymerases, mentioned 

above, which includes the TV Kb» polymerase, T3 RNA 
25 polymerase and SP6 RMA polymerase. 

It has indeed been discovered that these RNA 
polymerases, known to be r. M A-dependent , were also 
capable of transcribing an RK template, optionally 
choosing (for example by virtue of the fourth segment 
described above, a sequence favoring the rnrt.atron of 
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30 



transcription. 

The discovery of this RNA-dependent 



RNA 



polymerase activity makes it possible to have for the 
fir^ time RNA polymerases capable of transcribing RNA 
35 under the control of a promoter, starting from position 
+ 1 and in the absence of associated protein factor. 

It is also possible to carry out the method of 
the invention with mutated RNA polymerases which will 
REPLACEMENT SHEET (RULE 26) 
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be described in greater detail below. The importance of 
these mutated RNA polymerases is that some of them are 
capable of carrying out the transcription with a better 
yield when the template consists of RNA than when the 
template consists of a comparable DNA (that is to say 
containing deoxyribonucleotides A, C, G in place of the 
ribonucleotides A, C, 6. respectively, and containing 
the deoxyribonucleotide T in place of the 

ribonucleotide U) . 

The invention also relates to the use of an RNA 
polymerase capable of transcribing an RNA template, 
"under the control- of a promoter, in the absence of 
auxiliary protein factor, in a method of transcription 
of a template strand comprising an RNA target sequence 
in which said RNA polymerase is chosen from the T7 RNA, 
the SP6 RNA polymerase and the mutated RNA polymerases 
as defined above. The template strand may consist of 
RNA, or may consist of DNA in the region for initiation 
of transcription, and then of RNA next. 

The invention also relates to the use of an RNA 
polymerase capable of transcribing an RNA template 
3 under the control of a promoter, in the absence of 

C ! auxiliary protein factor, in a method of transcription 

la of a template strand comprising an RNA target sequence, 

25 in which said template strand consists of RNA at least 
between position + 5 and the 5' end of the target. The 
template strand therefore consists of RNA starting from 
one of the positions + 1 to + 5, and may therefore 
consist of DNA from position +1 to position +2, +3 or 
30 + 4 The article by S. Leary et al . mentioned below 
describes the transcription of a template consisting of 
DNA for positions + 1 to +6 and then of RNA for 
positions 7 and the next ones, with T3 RNA polymerase. 

The invention also relates to the RNA-dependent 
35 RNA polymerases (RNAdRNAp) obtained by modification of 
DNA-dependent RNA polymerases and which are capable of 
synthesizing RNA strands complementary to an RNA 
template. They may be -used. for example, m the 
REPLACEMENT SHEET (RULE 26) 
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sequencing of RNA, the synthesis of RNA probes, and 
amplification techniques allowing in particular the 
detection and quantification of RNA. 

The known natural RNAdRNAp are not suitable for 
5 use as polymerases in several applications because they 
have acquired a strong discriminatory capacity wxth 
respect to their specific RNA template. Furthermore, 
these enzymes have not been well characterized. For the 
majority, they form supramolecular complexes composed 
10 of both viral and cellular factors; these complexes, 
which are generally associated with the membranes, are 
difficult to purify and are unstable during isolation 
(B N. Fields, D.M: Knipe, Virology, Vols 1 and 2, Raven 
Press, New York, (1990); G.P. Pfeifer. R. Drouin, 
15 G.P. Holmquist, Mutat. Res. Fundam. Mol . Mech. Mutagen. 

288, 39 (1993) ) . 

Few RNAdRNAp have been cloned, sequenced and 
expressed. The enzyme QP replicase is the best 
characterized. This enzyme is composed of 4 subunits, 
20 of which 3 are host factors (M. Kajitani, A. Ishihama, 
•Judaic Acids Res. 19, 1063 (1991)). The QP enzyme has 
b^n isolated; it shows good processability and is 
capable of carrying out cyclic reactions (P.M. Lizardi, 
C E Guerra, H. Lomeli, I. Tussie-Luna, F.R. Kramer, 
25 Bio/Technology 6, 1197 (1988)). However, this enzyme 
remains very limited in its applications because xt 
recognizes as template only a restricted class of 
highly structured RNA molecules (V.D. Axelrod, 
E. Brown, C. Priano, D.R. Mills, Virology 184, 595 

30 (1991) ) - ... 

Another RNAdRNAp has been partially 

characterized. It is an enzyme from the Saccharoses 
cerevisiae L-A virus. This polymerase, which has been 
cloned, requires assembling of the viral particle; xt 
35 binds first of all to the plus RNA strand, and then 
induces the assembly of the proteins of the partxcle 
(T. Fujimura, R. Esteban, L.M. Esteban, R.B. Wickner, 
Cell 62, 819 (1990)). At least three factors are known 
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to cosine with the viral particles. These factory are 
necessary for the replication o£ », for — t » 
and for the coherent maintenance of the particle (T. 



Fuiimura. R.B. Wickner, «olec. Cell. Biol. 7, 420. 
(1 87)) . studies in vitro have shown that an .ntac, 
viral particle is necessary for the synthesis of the 
minus strand .replication, - - = *^ 



minus strand irepi^-"- 

Cell 55. 663 (1988,) and for the synthesxs of the plus 
strand (transcription, (T. Fujimura. R.B. Wickner J- 
Biol a». 264. 10872 .1989,,. Thus, the complex of 
this' system does not make it easily adaptable to an - 
vitro transcription system. Furthermore, 3 ust Irk the 
0P system. this system is very d.scr.mxnatory . 
accepting only « and L-A viral RNAs as ^template ,T. 
Fujimura. R.B. Wickner. Cell 55. 663 (1988,,. 

' An RNAdRNAp for which a broad template 

accepting capacity has been shown is the enzyme ,otJ*e 
poliomyelitis virus <J. Plotch. 0. Palant. Gluzman 
I Virol 63. 216 (1989)). However, several problems 

C v<5t-em' the priming is 
exist with this enzymatic system. 

pendent either on an unidentified ^ ^J^ 
tbe addition of a poly.U, J^^^^ 

given that priming with a poiy^u, y 
^ . . 4->io t.emDlate, many 



not seiecuvc w^-^ 

5 products of different si.es are ^thesxzed_ 

Particular products having twice the length of the 
template. Furthermore, the sequential synthesis of the 
P l" and minus strands has not been demonstra d 
Plotch. 0. Palant, V. Gluzman. J- Virol. 63. 216 
,0 (1989), T.D. Hey, O.C. Richards, E. Ehrenfeld, 

d 61. 802 (1987,, J.M. Lubinski . L.J. R-sone. 
A. Dasgupta. J . Virol 61, 2997 (1987),. 

Among the DNAdRNAp polymerases, the enzymes of 
the T3 and T7 bacteriophages are capable of using RNA 
35 as template under particular conditions For example^ 

t-^n^rribe a single-stranded RNA 
the T3 DNAdRNAp may transcribe a 

'template (i.e. the mes senger « £ 
resistance to neomycin) if it is i y 
. REPLACEMENT SHEET (RULE 26) 
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antisense strand of the T3 promoter including the 
sequence for initiation from + 1 to + 6 (S. Leary, 
H J Baum, Z.G. Loewy, Gene 106, 93 (1991)). It is also 
known that the T7 RNA polymerase can transcribe, from 
one end to the other, an RNA template in the absence of 
the promoter sequence (M. Chamberlin, J. Ring, J- Biol. 
Chem 248, 2235 (1973)). Furthermore, it has been shown 
that the T7 RNA polymerase can efficiently transcribe 
two small specific RNA templates, the "X" and «Y« RNAs, 
producing both plus and minus RNA copies. This 
replication, which is obtained in the absence of a 
consensus promoter sequence, appears to be dependent on 
the presence of a specific secondary structure 

c a qharo Cell 57, 423 (1989), 
(M.M. Konarska, P. A. Sharp, ^ix 

15 M.M. Konarska, P.A. Sharp, Cell 63, 609 (1990)). On the 

other hand, the »X« and «V RNAs are not replicated by 

the T3 RNA polymerase, and it is not known if this 

enzyme is not capable of replicating highly structured 

RNAs or if the sequence specificity of this enzyme 

prevents its recognition of the «X- and «Y<< RNAs. In 

.he absence of a promoter, it has also been shown that 

fche T7 DNAdRNAp was capable of carrying out the 

extension of two overlapping RNA strands in antisense 

(C Cazenave and O.C. Ulhlenbeck, Proc. Natl. Acad. 

25 Sci USA 91, 6972 (1994)). Likewise, it has been shown 

that the wild-type T7 DNAdRNAp is capable of carrying 

out the extension of an RNA primer on a single-stranded 

DMA template, in the absence of a promoter (S.S. Daube 

andP.H. vonHippel, Biochemistry 33, 340 (1994)). 

The best characterized bacteriophage enzyme is 

the T7 RNA polymerase, a monomeric enzyme of 98 kDa 

(B . A. Moffatt, J.J- Dunn, P.W. Studier, J- Mol . Biol 

173, 265 (1984)). This monomeric polymerase has all the 

essential properties of an RNA polymerase, that is to 

• i. • ~-f a nromoter, initiation of 

35 say recognition of a promouei., 

• 4--„r, extension and termination 

transcription, extension 

(M. Chamberlin, T. Ryan, The Enzymes XV, 87 (1982)). 
Furthermore, the catalytic activity requires few 
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elements. namely a template. ribonucleos.de 
triphosphates and the divalent «g» ion. and it does not 
req uire any auxiliary protein factor for the patron 
or tarnation of transcription, unlike the other RKA 
5 polymerases (M. Chamberlin, T. Ryan, The Enzymes XV, 87 

(1932) ) . _ ... 

Mutagenesis of the T7 RSA polymerase gene has 

made it possible to identify and to define regions or 
residues involved in the polymerase function. A 

«;= atrateov has consisted in exchanging 
10 mutagenesis strategy _ lnsa 
elements between the T7 RNA polymerase and its close 
relative the T3 RNA polymerase whose ammo acid 
sequence is 82% identical (K.E. Joho. L.B Gross, 
N.J. McGraw. C. Raskin, W.T. McAllister, J. Mol. B.ol. 
15 215, 31 (1990)). This strategy has led to the 
identification of polymerase elements involved in the 
recognition of the promoter. It has been shown, for 
example, that the substitution of a single amin , . acid 
in the T3 (or T7) enzyme allows the mutated enzyme to 
20 spe cifically recognize .he heterologous T 7 <««, 

same manner, reciprocal substitutions in the respective 
promoter sequences confer on the mutated promoter the 

■ -Kir i-Vie> heterologous enzyme 

25 capacity to be recognized by the neter y 

r Diaz K Joho, W.T. McAllister, J- 
(C.A. Raskin, G. Diaz, r^. 

Mol. Biol. 228, 506, (1992) ) . 

T7 RNA polymerase has been crystallized and its 
structure determined at a resolution of 

v T Chunq J. P. Rose, B.-C. Wang, Nature 
^0 (R. Sousa, Y.J. cnung, u 

" 364 593 (1993)). From this structural study, sequence 

,v f Joho L.B. Gross, N.J. McGraw, 
alignments (K.E. Jono, n 

C Raskin, W.T. McAllister, J ■ Mol. Biol. 215, 31 
(1990); S. Mungal, B.M.Steinberg, L.B. Taichman, J. 
35 Virol. 66, 3220 (1992), W.T. McAllister, C.A.Raskin, 
Molec. Microbiol. 10, 1 (1993)) and mutagenic studies 
(D. Patra, E.M. Lafer, R. Sousa, J. Mol. Biol. 224 307 
{19 92); L. Gross, W- J . Chen, W.T. McAllister, J ■ Mol. 
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Biol 228, 1 (1992)), it has been possible to correlate 
the functional elements of the T7 enzyme with the 
structural element. T7 RNA polymerase may be divided 
into two functional domains: a promoter recognition 
domain and a catalytic domain (R. Sousa, Y.J. Chung, 
J. P. Rose, B.-C. Wang, Mature 364, 593 (1993), 
W.T. McAllister, Cell. Molec. Biol. 39, 385 (1993)). 

The T7 RNA polymerase asparagine 7 48 has been 
shown to interact with nucleotides -10 and -11 in the 
promoter sequence, an interaction shown to be 
responsible for the promoter specificity (C.A. Raskxn, 
G Diaz^K. Joho, W.T. McAllister, J. Mol . Bxol . 228, 
-5-06 J&S£d>) ■ The possibility that a sigma-type 
infraction between the T7 polymerase and its promoter 
can exist in the bacteriophage system has been 
mentioned. Indeed, a sigma-type sequence, corresponding 
to the 2.4 region of sigma, i.e. the region of sigma 
interacting with the "Pribnow box" ( TATAATG sequence 
recognized by the E.coli sigma 70 transcription factor) 
(C Waldburger, T. Gardella, R. Wong, M.M. Susskind, J. 
Mo' Biol. 215, 267 (1990); D.A. Siegele, J.C. xiu. 
W.A. Walter, C.A. Gross, J. Mol. Biol. 206, 591 
-1989)), exists in the N-terminal region of the T7 RNA 
polymerase between amino acids 137 and 157 (L . Gross 
W-J. Chen, W.T. McAllister, J • Mol. Bxol. 228, 
(1992)) Moreover, although it has not been possible to 
attribute any function to it, the 230 to 250 region 
exhibits sequence homologies with the E.colx 
repressor (McGraw, N.J., Bailey, J.N.. Cleaves, G.R. , 
Dembinski, D.R., Gocke, C.R.. Joliffe, L.K., 
Macwright, R.S. and McAllister, W.T. Nucleic Acids Res. 

13, 6753 (1985) ) . 

The catalytic domain consists of a pocket 
resulting from the bringing into close proximity of 
S several regions dispersed over the primary structure 
(R Sousa, Y.J. Chung, J. P. Rose, B.-C. Wang, Nature 
364 593 (1993), W.T. McAllister, C.A. Raskin, Molec. 
Microbiol. 10, 1 (1993); D.Moras, Nature 364, 572 
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(1993)). This pocket contains in particular several 
conserved motifs among which the A and C motifs are the 
best conserved in the polymerases (Poch, 0 
Sauvaget, I.. Delarue, M. and Tordo, N . EMBO J • 8, 3867 
(1989); Delarue, M. . Poch, 0., Tordo, N. and Moras, D. 
Protein Engineering 3, 461 (1990); W.T. McAllister 
CA Raskin, Molec. Microbiol. 10, 1 (1993)). A thxrd 
motif, the B motif, is conserved in DNA-dependent RNA 
and DNA polymerases whereas a B' motif which xs 
different (both for the sequence and for the apparent 
structure) exists in the RNA-dependent RNA and DNA 
polymerases (Poch, 0., Sauvaget, I., Delarue, M. and 
Tordo, N. EMBO J".' 8, 3867 (1989); Delarue, M. , 
Poch 0., Tordo, N . and Moras, D. Protein Engineer xng, 
3 461 (1990); L.A. Kohlstaedt, J. Wang, J.M. Friedman, 
p'.A Rice, T.A. St.it, Science 256, 1783 (1992); 
W.T. McAllister, C.A. Raskin, Molec. Microbiol. 10, 1 

(1 " 3)> 'one of the aspects of the present invention is 
based on the discovery that certain mutated DNA- 
dependent RNA polymerases are capable of transcribing a 
single-stranded or double- stranded RNA in the presence 
of a double-stranded DNA promoter. Furthermore, these 
mutant enzymes are not very capable or are i»«P*£ of 
3 transcribing single-stranded or double- stranded 

the presence of a double- stranded DNA promoter. They 
are therefore preferably or strictly RNA-dependent. 
Their use is particularly advantageous in cases where 
it is desired to selectively transcribe the RNA, xn 
0 particular when the starting biological sample contaxns 
or risks containing DNA having a sequence identxcal or 
similar to that of the RNA to be amplified. 

The subject of the invention is therefore an 
RNA polymerase capable of transcribing a polynucleotide 
,5 segment of interest of any sequence contained xn a 
polynucleotide template, by synthesizing, xn the 
presence of said template, and under the control of a 
promoter, a product of transcription containxng an RNA 
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sequence complementary to the sequence of saxd 
polynucleotide segment of interest, characterized xn 
that it is capable of synthesizing said product of 
transcription with a better yield when said sequence of 
interest contained in the template consists of RNA than 
when said sequence of interest contained xn the 
template consists of DNA. 

The invention relates in particular to an RNA 
polymerase defined as above such that the ratio of the 
D yield of product of transcription of a DNA template to 
the yield of product of transcription of an RNA 

j ■ cl i~ less than 95%, especxally 
template, expressed xn %, xs less tnan 

less than 85% and in particular less than 70%. 

The subject of the invention is in partxcular 
5 an RNA polymerase as defined above, characterized xn 
that the ratio of the yield of product of transcrxptxon 
of the RNA template to the yield of product of 
transcription of the DNA template is at least equal to 
2 and in particular at least equal to 10. 
0 The "yield" of transcription is the molar ratio 

of the quantity of product of transcription to the 
quantity of polynucleotide template present at the 
origin. This yield may be easily determined 
experimentally, by introducing into the reaction medxum 

t-he oolvnucleotide template. 
25 a determined quant xty of tne poxynu. 

For comparison of the yields obtained with a DNA 
template and an RNA template, conditions other than 
those of the nature of the template must obviously be 

comparable . t 

The RNA polymerase of the invention xs capable 
of transcribing a polyribonucleotide template of any 
sequence, and it differs from Q P -replicase in thxs 
respect. It preferentially or exclusively transcrxbes 
an RNA template, and it differs from known phage DNA- 
35 dependent RNA polymerases in this respect. 

The RNA polymerases of the invention, unlike 
the known natural RNAdRNAp polymerases, are xn 
particular RNA polymerases capable of functioning 
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without associated protein cofactor(s). They may 
however be provided in the form of mul timers, and in 

particular of dimers. 

The mutated RNA polymerases of the invention 
5 are therefore generally obtained from RNA polymerases 
which are themselves capable of functioning without 

protein cof actors . 

The RNA polymerases of the invention may be m 
particular RNA polymerases which are derived by 
10 mutation from a virus or phage DNA-dependent RNA 
polymerase, and in particular from a DNA polymerase of 
an E.coli phage. Among the E.coli phages, there may be 
mentioned in particular T3 , T7 and SP6 . 

An RNA polymerase according to the invention 
15 may possess a protein sequence homology greater than 
50%, and in particular greater than 80% with a wxld- 
type RNA polymerase of the family of DNA-dependent RNA 
polymerases including the T7 RNA polymerase, T3 RNA 
polymerase and SP6 RNA polymerase. 

The abovementioned family of DNA-dependent RNA 
polymerases is known; see for example the article by 
R. Sousa, TIBS 21, 186-190 (1996), and the references 

cited in chat article. 

Among the polymerases of the invention, there 
25 may be mentioned in particular those which contain at 
least one mutation in a region corresponding to the T7 
RNA polymerase sequence containing amino acids 625-652, 
and in particular those which have the composition of a 
wild-type DNA-dependent RNA polymerase, with the 
3 0 exception of the fact that they contain at least one 
mutation in said region. "Mutation" is understood here 
to mean the replacement, deletion or insertn.cn of an 
amino acid. 

There may be mentioned for example the RNA 
35 polymerases containing at least one mutation at a 
position corresponding to one of positions 627, 628, 
631, 632 and 639 of the T7 RNA polymerase amino add 
sequence; in particular said mutation may comprise the 
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replacement of an amino acid residue, chosen from 
arginine, lysine, serine and tyrosine, of the wild-type 
RNA polymerase with another amino acid residue. The 
amino acid replaced is for example an arginine or a 
5 lysine. The replacement amino acid may be chosen m 
particular from alanine, valine, leucine, isoleucine, 
glycine, threonine or serine. It is understood that the 
expression "amino acid" designates here, by a misuse of 
language, an amino acid residue engaged in a pepti.de 
10 bond. 

Reference was made above to the pepto.de 
sequence of the T7 RNA polymerase. The numbering of the 
amino acid residues adopted here is that described by 
Dunn, J.J. and Studier, F.W. J. Mol. Biol. 148(4), 303- 
15 330 (1981), and by Stahl, S.J. and Zinn, K. , J. Mol. 
Biol. 148(4) , 481-485 (1981) . 

The invention also relates to: 

- a gene encoding an RNA polymerase as defined 
above: such a gene may be obtained for example 

2 0 according to a method similar to that described below 
in the experimental part; 

- an expression vector into which such a gene 
is inserted, said vector being capable of expressing 
said RNA polymerase in a host cell; this vector may be 

2 5 obtained in a manner known per se; 

- a host cell containing such a vector. 

The invention also relates to a method of 
producing an RNA polymerase as defined above, 
characterized in that: a) a gene encoding a wild- type 

30 RNA polymerase is obtained in a known manner, b) at 
least one mutation is performed on said gene, c) the 
mutated gene obtained is inserted into an expression 
vector, d) said vector is expressed in a host cell m 
order to obtain a mutated RNA polymerase and e) among 

35 the mutated RNA polymerases obtained, those which 
exhibit at least one of the properties of an RNA 
polymerase as defined above are selected. 
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A more detailed description of a particular 
embodiment of the method of the invention will be given 
below in the case of the use of the TV RNA polymerase 

as starting material. 

A modular gene for T7 DNAdRNAp was prepared, 
this gene resulting from the assembly of different 
cassettes (see Example 1 and Figure 1) . 

The modular gene thus defined is characterized 
in that it contains 10 cassettes bordered by unique 
restriction sites in the cloning vector. 

in particular, these cassettes, bordered by 
unique restriction sites, are characterized in that 
each cassette comprises a region of interest, _n 
particular those involved in promoter recognition 
(region exhibiting homology with the E.coH a factor; 
region exhibiting homology with the E.coli X -pressor; 
. egion conferring promoter specificity) and those 
Involved in the catalytic site (motif A; motif B; motxf 

C> ' For the definition of motifs A, B and C. see 

fo- example R. Sousa, TIBS 21, 186-190 (1996). 

These cassettes, derived from the T7 DNAdRNAp 
gene 1. were obtained with the aid of conventional 
molecular biology techniques (P.M. Ausubel R. Brent 
r E . Kingston, D.D. Moore, J.G. Seidman, et al . Current 
Protocols in Molecular Biology (Current Protocols. 
1993)), in particular PGR, which made it possible to 
introduce restriction sites by silent site-ducted 
mutagenesis, and subcloning into a cloning vector. 
> The modular gene thus obtained is characterized 

by the presence of restriction sites bordering the 
cassettes, these restriction sites being the Nco I (-2 
+ 4) Bel I (218, 223), Hind III (539, 544), Sad (776, 
78 l') PstI (1587, 1592), Bglll (1811. 1816), Ndel 
5 (1865, 1870), Xhol (1951, 1956), Clal (2305, 2310), 
Sail (2497, 2502), Xbal (2660, 2665) site; position 1, 
in nucleic acids, corresponds to the adenine of the 
initiator ATG codon and position 2652 to the third base 
REPLACEMENT SHEET (RULE 26) 



- 21 - 



of the TAA terminator codon. The Ndel site (2504, 2509) 
was destroyed. All the mutations inducing these 
restriction sites are silent, except the mutatxon 
generating the Ncol site which induces the replacement 
5 of asparagine at position + 2 with a glycine. Posxtxon 
1 in amino acids, corresponds to the first methxonxne 
position 883 corresponds to the carboxy- terminal 

alanine. t 

The modular gene, cloned into a clonxng vector 
10 pGNEX derived from pGEM-1 in which the polylinker has 
been replaced by an adaptor containing the Nco I, 
EcoRI, Xbal restriction sites constitutes the basxc 
support for subsequent mutageneses. This is made 
possible by the fact that each cassette containing a 
15 region of interest is bordered by unique restrxctxon 
sites in the cloning vector. > 

?he introduction of nonsilent mutations, wxth 
<%Vhe aid k PGR techniques, into one or more cassettes 
^ of the mcLlar — Previously defined, led to genes 
20 encoding X-r.^- exhibiting an amino acid sequence 
differing bAat least one amino acid wxth respect to 
th . T 7 expressed from the modular gene. Mutant genes 
were in particular prepared which encode at least one 
modified amino \cid in the B motif of the wxld-type 
25 enzyme, for exarAe with an alanine (A) in place of an 
arginine (R) at position 627 and/or an alanxne A) ,n 
place of a serine \> at position 628 and/or an alanxne 
(A) in place of a l\in. (A) at position 631 and/or an 
alanine (A) in place \ an arginine <R) at posxtxon 632 

/A\\nr 1 nlace of a tyrosine (Y) at 
30 and/or an alanine (A) yn place x 

position 639. \ 

Mutant genes were also obtained which ^ CO (%%£ X J> fijO; 
polymerase whose 625VTRSVTKRSVMTLAYGSKEFGFRQQVLD652 ^ 
region comprising the B motif has been replaced as a 
35 whole or in part by the homologous region B' present xn 
some RNA-dependent polymerases, in particular those of 
the polymerases of the hepatitis C vxrus 
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(K C OTKR CH A SOV LT rSC SN T^xf &g ^e^east intense 
32 (HNTTLGIPQGSWSPILCNIFLDKCT. 

The genes described above have been cloned into 
a vector pMR resulting from the ligation of the Sspl 
fragment of pMal-c (Biolabs) containing in P-t-ular 
the lad' repressor, and of the Sspl fragment of P MH 
(V. Cheynet, B. Verrier, F. Mallet, Protein expression 
and purification 4, 367 (1993)) containing a 
udnici.tron making it possible to achieve a high level 
of expression, as well as a sequence encoding a poly- 
histidine tail fused with the terminal end of the 
cloned gene (Example 1. Figure 2). The expression of 
the recombinant proteins T7 RNAp into the bacterial 
strain BL21 represents up to 30% of the total proteins 
of the bacterium. The proteins solubilized m a 

v^i-^ buffer containing a high salt 

deoxy cholate DUltei n 

ccc.ntr.tion are deposits* on a TM*. column 
(Clontech, allowing specific purificat.on by chelat on 
„ith the ion of proteins having a poly-histid.ne u l 
13 0 to 2200 „ of polymerases are thus 
20 T.1 of culture, with a purity greater than 95% as 
indicated <i> by a coomassie blue staining (Example 1 
Figure 3,. (11) by a Western-blot analyses w.th^ a 
guinea-pig polyclonal antibody antr-T7 KM 
<bio M erieu£ an, a youse monoclonal antibody (Quragen, 
lti-<n»% pt&Vby the absence of endonuclease. 
single-stranded* and double-stranded exonuclease or 

. ... Hptermined essentially 

ribonuclease activity, as determine 

according to the method of He et , <B ..He M 

D.L. Lyakhov. H. Gartenstein, G. Diaz, et a 

Expr Purif 9. 142 (1997)). This result reflects the 

performances of the host-pMR-BL21 vector pa ir and of 

the method £ ^ " 

MRGSHHHHHSVLE tail. y 
; The subject of the invention is also the use of 

an EN* polymerase as defined above, in a * 
transcription of a polynucleotide segment of mt rest 
Ling any sequence, said segment, of the Km type, 
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being contained in a polynucleotide template, so as to 
synthesize, in the presence of said template, a product 
of transcription containing an RNA sequence 
complementary to the sequence of said polynucleotide 

5 segment of interest. 

According to a particular embodiment, said use 
is characterized in that said polynucleotide template 
comprises, upstream of said polynucleotide segment of 
interest, a promoter recognized by said RNA P°ly~»«^ 
10 and in that said product of transcription is an RNA 
complementary to a sequence of the template starting at 
a site of initiation of transcription for said 
promoter . 

The RNA polymerases of the invention may be 
15 used in particular to carry out (i) an amplification of 
an RNA target isothermally . (ii) a direct sequencing of 
RNA and (iii) the synthesis of RNA of special interest 
(for example probes, ribozymes and the like). In 
. addition, RNA polymerases of the invention are capable 
,0 of incorporating modified bases into the newly- 
synthesized strand, which facilitates in particular the 
quantification or the use of said strand. 

The invention relates in particular to the use 
of these recombinant enzymes thus expressed and 
25 purified in a method of synthesizing RNA from an RNA 
template, under the control of a promoter. 

The enzymes thus purified were evaluated, in a 
promoter-dependent context, on different templates 
(Example 2, Figure 4 on the one hand, and Example 3 
30 Figure 6, on the other hand) in particular a template 
containing a single-stranded RNA. It has been shown 
that a mutated polymerase obtained according to the 
invention was capable of generating a specific 
transcript of the correct size in particular on a 
35 single-stranded RNA template. In Example 2, it is 
indicated that the wild-type enzyme identically 
produced does not appear to be able to carry out this 
phenomenon. However, Example 3 shows that the wild-type 
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enzyme in fact possesses the property of transcribing 
an RNA template. These apparently divergent results are 
explained by the fact that the experimental conditions 
in these two examples are different. Indeed, in Example 
2 the presence of the transcript is identified by the 

. ■ o Tirnp labeled with 

technique of incorporating a UTP, i 

radioactive phosphorus, whereas in Example 3 the 

technique used is Northern blotting for the group 2 

templates, and in the latter case, the detection of the 

transcript by the Northern blotting technique is 40 

™«r-*» sensitive than the detection by 
times more sensitive 

incorporation of radioactive phosphorus. Example 2 
below therefore shows that a mutated polymerase 
obtained according to the invention is capable of 
generating a specific transcript of the correct size on 
a single-stranded RNA template, and Example 3 shows 
that in fact the corresponding wild-type polymerase is 
capable of generating a specific transcript of the 
correct size on RNA templates independently of their 



2 0 sequence. * _ 

Furthermore, such a mutated polymerase is 

incapable, unlike the wild-type polymerase, of 
generating a transcript of the correct size on a 
single-stranded or double-stranded DNA template. If the 
25 Mg 2+ ion present in the reaction medium is replaced by 
the Mn 2 * ion, the mutated enzyme, on a single- stranded 
RNA template, does not generate under these conditions 
a specific transcript of the correct size, but is 
nevertheless capable of generating large quantities of 
30 abortive products. Such a mutated polymerase is xn 
addition capable of displacing an RNA /RNA hybrid, 
in the appended drawings: 

Figure 1 illustrates the amplification 
strategy, described in detail in Example 1 below, for 
35 the construction of a modular gene for the T7 RNA 
polymerase, 

- Figure 2 illustrates the structure of the 
expression vector pMR containing this modular gene, 
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- Figure 3 represents electrophoretic profiles 
of the Proteins expressed with the aid of this vector, 
as described in detail in Example 1 below, 

_ Figure 4 schematically represents the 

-in the transcription trials of 
template systems used in the 

Me l^ e S repress the electrophoretic 

^■f hranscription obtained in 
profiles of the products of transcript 

Example 2 below, and 

. Figure 6 schematically represents the 
template systems used in the transcription trials o£ 

EXamPle ^""following examples illustrate the 



invention . 



EXAMPLES 

for T7 KNA 



) 



Example 1- can-tructis^^ 

purjLfication^ 

The T7 RNA polymerase gene was constructed in 

* ™ takincr into account the regions of 
modular form, taking inu 

homology with the other polymerases as well * 

unctions associated with certain domains o J ™ 

Polymerase. It was divided into 10 regions or ~sse«es 

^Figure 1-A) . each region being delimited by unique 

estriction sites in the cloning vector. Si, cassettes 

haracteristic in that they contain respectively a 
are characteristic s 

domain having similarity with part o£ tne 
factor, a domain having homology with the E.coli lambda 
repres °r. * domain involved in the phage polymerase 
repressor, motifs conserved in 

promoter specificity, the A and C m 

the template-dependent polymerases and the B motif 
^served in the ^-dependent polymerases. E ach reg on 
is amplified by PCK (F.H. Ausubel , R - Brent 

. n n Moore J.G. Seidman, et al , Current 
R.E. Kingston, D.D. Moore, J , Current Protocols, 

Protocols in Molecular Biology (Current 
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19 931) (Figure la-Bl) from the wild-type gene (gene 1) 
using primers defined as follows: (i) these primers 
contain silent mutations allowing the introduction of 
restriction sites delimiting the cassettes; lu) the 
primer situated at position 3- of one region and the 
primer situated at position 5- of the contiguous region 
partially overlap. The contiguous amplication 
products are mixed and reamplified using external 
primers (Figure la-Bl) . The 5 fragments generated are 
cloned into the vector PCP.II (mvitrogen) (Figure 1- 
B2) From the 5 cloned fragments, the T7 RNA polymerase 
g ene was reconstructed from the 5' end to the 3 • end by 
successive subclones into the cloning vector pGNEX 
(Figure 1-B3) . This vector is derived from the plasmid 
pGEM-1 (Promega) in which the polylinker is replaced by 
an adaptor containing the N =oI-Eco R I-*>aI 
sites The modular gene for T7 RNA polymerase (T7 RNA 
pol) thus obtained contains 17 unique cloning sites rn 
p GNS X of which 11 newly-introduced sites (Figure 1-C>. 
This gene was then subcloned into the expression vector 
(Figure 2,. This vector, derived from the vector 
P Z <V Cheynet. B. Verrier. F. Mallet. Protein 
egression and purification 4, 367 ,1993,). contains 
the strong tac promoter regulated by IPTG (isopropyl- 
beta-D-thiogalactopyranoside) . a short minicistron 
surrounded by two ribosome-binding sites constituting 
an efficient context for initiation of translation and 
a polyhistidine tail fused with the N-terminal end of 
the expressed protein allowing its purification by 
, metal ion-affinity chromatography. The gene encoding 
the lad' repressor allowing a better control of 
expression is also present in pMR. This expression 
vector is transformed according to conventional methods 
,PM. Ausubel, R. Brent, R.E. Kingston, D.D. Moore, 
5 j.;. seidman, et al , Current Protocols in Mo ecular 
Biology (Current Protocols, (1993)1 in the B.coli BL21 
bacterial strain. A preculture is used to inoculate 
30 ml of culture medium. The expression of the proteins 
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is induced by IPTG for 4 h at 37°C as soon as the 
optical density at 600 nm of the culture reaches 0.6. 
The T7 RNA pol thus expressed represents 30% of the 
total bacterial proteins (Figure 3). It is then 
extracted from the bacteria by lysis (sonication) m 
the presence of a detergent. From the soluble extracted 
fraction, the T7 RNA pol is purified by Co + -ion 
affinity chromatography and eluted with an imidazole 
gradient. It is analyzed on an SDS-PAGE gel and 
visualized by Coomassie blue staining (Figure 3). The 
absence of degradation products is verified by Western 
blotting with a polyclonal antibody anti-T7 RNA pol 
oroduced in guinea-pigs and a monoclonal antibody anti- 
MRGSHe which is commercially available (Quiagen) . The 
absence of spurious endonuclease, exonuclease and RNase 
activities is verified according to the protocol 
described by He et al. Protein Expr Purif 9, 142 
(1997)). From 20 ml of culture 700 to 800 Mg of 
-nzymes, having a purity greater than 95%, are obtained 
in a reproducible manner. The purification -yield is. 

75%. . . 

The ooint mutations are created by sequential 

PGR (F.M. Ausubel, R. Brent, R.E. Kingston, D.D. Moore, 
j G . Seidman, et al, Current Protocols in Molecular 
Biology (Current Protocols, (1993)) (or double PCR) , in 
the same manner in which the restriction sites were 
introduced: the internal primers contain the mutation 
to be introduced; the external primers flank the region 
to be mutated and contain the restriction sites 
delimiting this region in the modular gene. This 
cassette thus mutated is then cloned into the modular 
gene. Synthetic oligonucleotides may also replace a 
complete cassette. 
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t. 0 ^H to the f^^s mentionpd in Example 1: 

Figure 1: illustrates the amplification 
strategy for the construction of the modular gene for 

Division of the T7 RNA 
5 the T7 RNA polymerase. (A). Division 

polymerase gene (vertical arrows) with respect to the 
regions of interest. The regions marked from 1 to 6 
are: 1, homology with part of the E.coli sigma factor; 
2 homology with the E.coli lambda repressor; 3, motif 
10 A . 4, motif B; 5, phage polymerase promoter 
specificity; 6, motif C. (») = <1> Position of the 
primers containing the silent mutations for the 
creation of the restriction sites (-Q ); Nc, Ncol; Be 
Bell; H. HindHI; Sc, Sad; P, P*"; Bg, Bglll; Nd. 
15 Ndex; Xo, Xhol; C Clal; si. Sail; Xb, Xbal . Extension 
by overlapping PGR leads to 5 fragments containing 3 

to F5) . Sub-cloning means: 
restriction sites (Fl to taj 

subcloning. (2) Each fragment thus generated (Fl to F5 
is cloned into the vector pCRII. E, EcoRI . (3) 
Reconstruction of the modular gene from the 5- end to 
the 3' end, using at each step the restriction site 
common to 2 fragments and the EcoRI restriction site of 
the vector pCRII and of the cloning vector pGNEX ; the 
Xbal restriction site common to the last fragment and 
to the vector pGNEX is used for the last subcloning. 
(C) structure of the modular gene for T7 RNA polymerase 
and position of the unique restriction sites in the 
vector PGNEX which already exist (at the bottom) and 
created by mutagenesis (at the top) . 

Figure 2: illustrates the structure of the 
expression vector pMR containing the modular gene for 
T7 RNA polymerase. Ptac : tac promoter (black box); 
RBS1-MC-RBS2 , minicistron surrounded by 2 ribosome- 
binding sites (white arrow); (His) 6T7RNAPcas modular 
35 gene for T7 RNA polymerase (gray arrow) ; rrnB Tl T2 
ftrong transcription terminators (box with dotted 
lines); bla. gene for resistance to ampicillm (black 
arrow); pMBl ori/M!32 ori-, replication origins (thm 
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white box); lad*, gene encoding the lacl« repressor 
(hatched arrow); some restriction sites are indicated, 
including the new sites introduced by mutagenesis 

(underlined) . 

Figure 3: shows the electrophoretic profiles 

obtained allowing analysis of the expression and of the 
purification of the T7 RNA polymerase. The bacterial 
lysates are prepared from 1 ml collected from 30 ml of 
culture in order to verify the expression of T7 RNA 
Polymerase in the presence ( + ) or absence (-) of IPTG. 
The T7 RNA polymerase is extracted from 20 ml of the 
same culture. The soluble fraction is loaded onto the 
purification column- (lane L) and the T7 RNA polymerase 
<s eluted (lane E) with an imidazole gradient. M, 
molecular weight marker. The proteins are visualized on 
a 10% SDS-PAGE gel by Coomassie blue staining. The 
arrow indicates the position of the T7 RNA polymerase. 

Example 2: Trans^ip^io^^ templates 

hy i-h* mutated T7 RNA polymerase R627A under 
the control of a promote r 

The template systems used in this example are 
schematically presented in Figure 4. The single- 
stranded DNA transcription system comprises a template 
strand (a) of 50 bases having the sequence ^ & TbW. 2) 
3 • ATTATGCTGAGTGATATCCCAACCGGCGTCACAAGTGAGTACCAATACCG5 ^ 

and hybridized from positions -17 to +1 with the non- 
template Promoter ^tran^jb) having the sequence 
5 ' TAATACGACTCACTA^AG 3^ (blocked at its 3' end). The 
double-stranded DNA transcription system comprises the 
template strand (a) hybridized with the non-template 

strand (c) having the sequence „ £A TV NO'. fJl) 

5 ■ TAATACGACTCACTATAGGGTTGGCCGCAGTGTTCACTCATGGTTATGGC3 ^. ^ 

The single-stranded RNA transcription system 

consists of a DNA hybrid strand from positions -17 to 

-1 and RNA from positions + 1 to + 33 having the sequence 
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a 3 • ATTATGCTGAGTGATATCCCAACCGGCGUCACAAGUGAGUACCAAUACCG5 ' , 

9 hybridized «ith *e non-template propter strand (bK 

Te expected complete transcripts on the three system 

S^n, *" ° f Ve^acions are performed in 20 ,1 of a 
C / buffer derived from that described by J.F. Mrllrgan, 

( I » <JX O.W. Witherell. O.c. uhlenbec*. Nuclei 

Acids ReA 25, 87S3 ,198,,. namely Tris-HCl 40 «. 

V -/i-r,o 1 mM PEG 8% (g/V), trxton 0.01% 

v.h 8 1 speVmxdxne 1 nua, u 

1n !v/v : , BSA 5\, g /100 „1. 1 Ml (40 u) of porcine tmAguard 

Pha ma! a sYotech, . UTP 12.5 a 32P UTP 0 . 5 ,d 

ri bonucleoside\triphoshates A. G. C Mg.OAc,, 6 m« The 
template concentration is set at 10» copres ot each 
-i . . on \i of reaction. The wild-type T7 ema 

Q 15 strand rn 20 to of ^ ^ 

S .. V £0 " Sdding in t 

ia --™«, are denWed for 5 minutes at o5 C xn a 

^ block andlhen gradually brought to 37°C. The 

L 20 neating bl e inAiated by the addition of the 

reactions are * at 37 o C and then 

rt Polymerases, xncubatfed for J. no 

CO Looped by the addition of an equal volume of 2x blue 

stopped y v xylene cyanol 

formamide (formamxde >B0%, EDTA , * 

' lJ _ .» vA_ o 02%) and denatured for b 

,r a 02% bromophenol bl*e u.u^°> 

t- Q5°C 20 ul\of each reaction are deposxted 
minutes at 95°C. n± \« 

on a denature gel <20%\acrylamide. urea 7 M IX TBEK 
Ind then after migratioA the gel is autoradiography^ 
ana uicn » fTCoriak) The results 

at -70°C on a Biomax Mfe film (Kodak). 

• wvF-mmA are presented in Figure 5, 
30 (electrophoretxc prof xles)\ are p obtained 

• n «-v»o t-T-Anscription results oPtaxnea 
and in partxcular the transcrxp 

with the stated T7 RNA Polymerase K62 A ^ 
and the wild-type T7 PNA polUerase (wells 4-6, , o^t 
single-stranded RNA template! (wells 1 and 4,, double 

^ j n\ and sinale- stranded DNA 
35 stranded DNA (wells 2 and 5)V and smgl d 
(wells 3 and 6). The transcription on srngle-stranded 
Z. detected by detection of\a complete tra s t 

•vo«=> ncsinii the mutated 1/ mNrt 
33 bases, is possxble usxnf 
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polymerase R627A (well D and not the wild-type enzyme 
Aell 4) which produces on the other hand many abortive 
tXscripts; see nevertheless the different results 
oblined in Example 3 below. The mutated T7 RNA 
polLrase R627A exhibits a residual transcriptxon 
activity on double- stranded DNA (well 2), characterized 
by tVie presence of a predominant transcript which is 
smaller in size than the expected transcript, and the 
presence of a small quantity of abortive products. On 
singll-stranded DNA (well 3), this transcript of 
abnormal size disappears, whereas the quantity of 
abortile products increases. By contrast, the wild-type 
enzyme\allows the production of specific transcripts in 
the presence of DNA templates (wells 5 and 6), this 
enzyrnelxhibiting, moreover, a better transcription 
activity; on the double- stranded DNA template (well 5) 
than on\the single-stranded DNA template (well 6); for 
these tw\ templates, the wild- type enzyme induces the 
synthesisA of numerous abortive transcripts . These 
results slow that the replacement of the argmine 627 
fay an alWne confers on the mutant enzyme the 
Possibility of synthesizing RNA from an RNA template 
and induces\the loss of capacity to synthesize RNA from 
a DNA template. 

Example 3 : 

The T7 RNA polymerase obtained as in Example 1 
is purified by affinity chromatography according to the 
method described by Arnaud N. et al . , Gene, 199. 149- 
156 (1997). 

The purified T7 RNA polymerase has a speciric 

activity of about 200 U//xg- 

The sequences of the templates used for the 
transcription are described in Table 1 below. 

in the trials described below, the templates of 
sequence 1, 2 and 3 of Table 1 are called group 1, 
group 2 and group 3 templates, respectively. 
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The sequence of the probes used is represented 
in Table 1 (sequences Nos . 4 and 5) . 

Sequence No. 4 recognizes the 3' end of the 
products of transcription obtained with the templates 
of groups 1 and 3, and sequence No. 5 recognizes the 3 
end of the products of transcription of the group 

template. , 

The probes are labeled with y 32 P ATP usxng T4 
polynucleotide kinase or with a c^P ddATP using a 
deoxynucleotide terminal transferase. 

Transcriptional trials 

The reactions are carried out in a volume of 
20 ML containing • 40 m/M Tris- HCl, 1 *M spermidine 
50 Mg/ml of bovine serum albumin, 0.01% (v/v) Triton 
X100, 80 mg/ml PEG 8000, 1 Ml PNAguard (Pharmacia), 
5 mM magnesium acetate, 10 11 copies of template and non- 
template strands, the nucleoside triphosphates 
necessary for the transcription and the -cleosxde 
triphosphates labeled as indicated. When the labeled 
nucleoside triphosphate is «»P ATP, ™^ tl ~ j£ 
0 4 mM UTP, CTP and GTP, 12.5 mM ATP and 0.5 M Cx a P 
*TP 'New England Nuclear-Dupont , 800 Ci/mmol> are used. 
When the labeled nucleoside triphosphate is a P UTP, 

* n a mM ATP CTP and GTP, 12.5 MM of 
concentrations of 0.4 mM Air, L1 

ATP and 0.5 MCi a 32 P UTP (Amersham, 400 Ci/mmol) are 
used. When the labeled nucleoside triphosphate is 
Y 3 2p GT P, concentrations of 0.4 mM ATP, UTP, CTP and 
GTP, and 20 MCi Y 32 P GTP (Amersham, 5000 Ci/mmol) are 

USed - . ^ ^ -7HOP 

The samples are heated for 5 minutes at 70 C 
and then slowly cooled to 37 °C in order to allow the 
hybridization of the template and non- template strands. 
.60 ng of T7 RNA polymerase are then added and the 
lixture is incubated for 1 hour at 37°C. The reactions 
, are stopped by adding an equal volume of 2X bu fer (90% 
formamide, 25 mM EDTA, 0.02% xylene cyanol, 0 . 02 o 
bromophenol blue). The products of transcription are 
analyzed by electrophoresis, after heating for 
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minutes at 95°C, on a 20% denaturing polyacryl amide 
gel, and examined by autoradiography. 

For the Northern blot analysis, the reactions 
are carried out under the same conditions, but without 
5 the labeled nucleotides. After migration on a 20% 
denaturing polyacrylamide gel, the samples are 
transferred onto nylon membrane (Appligene) and the 
products of transcription are detected by the 
appropriate labeled probe and are examined by 
10 autoradiography. 

n^finition of sh ort synthetic templ ates 

Three types of short synthetic templates, 
15 containing a double- stranded DNA promoter, were defined 
in order to verify if the T7 RNA polymerase is capable 
of using an RNA template in a transcription reaction. 

The first type of templates (RNA+18) was 
defined so as to study the capacity of the T7 RNA 
,0 polymerase to transcribe an RNA template during the^so- 
called processive phase. This first type of tempxate 
comprises a double- stranded promoter followed 
downstream by a chimeric DNA- RNA sequence whose 
transition is situated 18 bases downstream of the site 
25 of initiation of transcription. 

The second type of templates ( RNA+ 1 ) was 
defined so as to study the capacity of the T7 RNA 
polymerase to transcribe an RNA template during the 
start of transcription phase. This second type of 
30 templates comprises a double- stranded promoter followed 

by an RNA sequence. 

The third type of templates (DNA) , which serves 
as a comparison, comprises a double-stranded promoter 
followed downstream by a DNA sequence. 
35 to study the influence of the non-template 

strand, the DNA, RNA+18 and RNA+1 templates may be 
either single-stranded (m) , or double- stranded 
heteroduplexes (bhe) , or double-stranded homoduplexes 
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(bho) . The double- stranded homoduplex template RNA+18 
forms an RNA-RNA duplex starting from position +18. 
Likewise, the double- stranded homoduplex template RNA+1 
forms an RNA-RNA duplex starting from the 
5 transcriptional start site. 

It should be noted that in Table 1, the 
different nucleotides have been represented with the 
letters A, T, C, G. When the template, or part of the 
template, is DNA, these letters represent 
10 deoxyribonucleotides . When the template, or part of the 
template, is in the form of RNA, these letters 
represent ribonucleotides and it should be understood 
that the symbol T is then used in place of the symbol 
U. 

15 The different transcription systems which have 

just been mentioned are schematically represented in 
the appended Figure 6 in which, for each double- 
stranded system, the top strand is the non-template 
strand and the bottom strand is the template strand. 
20 The DNA double- stranded promoter region is represented 
by a thick line. The DNA strands are represented by a 
thin line, the RNA strands are represented by a 
discontinuous line. The expression +1 corresponds to 
the transcriptional start site. The expression +18 
25 corresponds to the eighteenth base downstream of the 
transcriptional start site. In the representation of 
these various transcription systems, the letter m means 
single- stranded; the letter b means double-stranded; 
bhe means double- stranded heteroduplex; and bho means 
30 double- stranded homoduplex. 

The transcription systems of Figure 6 were used 
with each of the templates, or, depending on the cases, 
with some of the templates of Table 1. The group 1 
templates, using sequence No. 1, correspond to a 
35 double- stranded DNA promoter sequence followed by a 
consensus region of initiation +1 to +6 (Dunn and 
Studier, J. Mol. Biol., 166, 477-535, 1983), itself 
followed by a sequence of twenty- six bases downstream. 
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The group 2 templates correspond to a double- 
stranded DNA promoter sequence followed by a 
unfavorable sequence, since it can allow the early 
termination of transcription (Martin et al . . 
5 Biochemistry, 27, 3966-3974, 1988). 

The group 3 templates correspond to the same 
unfavorable initiation sequence as group 2, followed 
by the same downstream sequence as group 1. 

10 Results 

The results are summarized in the accompanying 

It is observed that the DNA-RNA transition on 
the RNA+18 template is efficiently passed via the 
15 elongation complex, as shown by the absence of an 
increase in early terminations around position +18. 
compared with the DNA control. 

The T7 RNA polymerase is capable of initiation 
of transcription on different RNA templates and is 
20 capable of fully transcribing these templates. The 
detection of transcripts having the expected size, for 
all the groups of templates, shows that the use of the 
RNA template is not dependent on the sequence, although 
the overall efficiency depends on the base composition, 
25 as is also observed on the control DNA templates. 

The different results mentioned in Example 
above result from the fact that the detection by the 
Northern blotting technique is in the present case 40 
times more sensitive than the detection technique used 
30 in Example 2, as was indicated above m the 
description. 

Th e T7 RNA polymerase is capable of 
transcribing an RNA -RNA duplex. No increase in the 
number of abortive products of transcription is 
35 observed with the RNA+1 homoduplex template. 

The transcription of the homo- or heteroduplex 
double-stranded templates is similar overall. In the 
case of the group 2 RNA+18 templates, more transcripts 
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of the expected size are obtained with the homoduplex 
system than with the heteroduplex system. 

The presence of the non-template strand 
influences the efficiency of transcription. Indeed, the 
yield of transcription is increased on the single- 
stranded templates, as shown, on the one hand, by the 
increase in the number of transcripts of the correct 
size and, on the other hand, by the fact that an 
initiation event is more frequently associated with the 
synthesis of a complete transcript. However, with the 
group 2 templates, the single- stranded system does not 

give better results. 

Thus, the- T7 RNA polymerase possesses 
transcriptional activity on RNA template, in the 
presence of a DNA double- stranded promoter. The yield 
of transcription observed is only 10 to 100 times lower 
than on DNA template. 
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TABLE 1 

SEQUENCE No. 1: 

3 • ATTATGCTGAGTGATATCCCTCTAGATCGTCGTATTGCGAGGTTCATGT 5 ' 
SEQUENCE No. 2: 

TCCCAACCGGCGTCACAAGTGAGTACCAATACCG 5' 



3 ' ATTATGCTGAGTGATA* 

SEQUENCE No. 3: 
3 ' ATTATGCTGAGTGATATCO 



+1 + * 8 



ICAACAGATCGTCGTATTGCGAGGTTCATGT 5 * 



+ 1 +1* 



SEQUENCE No . 4 : 

3' ATTGCGAGGTTCATGT 5* 

SEQUENCE No . 5 : 

3 * GIGftGTACCAATACCG 5' 
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